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A fractionally spaced equalizer is a nonrecursiue adaptive filter 
whose tap weights are spaced a fraction of a symbol interval apart. 
Such an equalizer can significantly enhance modem performance in 
the presence of severe linear distortion, when compared with a 
conventional synchronous equalizer whose taps are spaced a symbol 
interval apart. However, a digitally implemented, fractionally spaced 
equalizer generally will exhibit long-term instability when the con- 
ventional tap-adjustment algorithm is used. This occurs because, in 
contrast to the synchronous equalizer, a fractionally spaced equalizer 
generally will have many sets of tap values, which result in nearly 
equal values of mean-squared error (mse). Some of these tap set- 
tings — which invariably will be attained because of biases in the 
digital tap-updating circuitry— are large enough to cause register 
overflows and consequent performance deterioration. In this paper 
we report how a simple modification in the tap-adjustment algorithm 
provides a solution to the above problem. The modified tap-adjust- 
ment algorithm prevents the buildup of large coefficient values by 
systematically "leaking" or decreasing the magnitudes of all the 
equalizer tap weights. For an experimental modem operating at 9.6 
kb/s, it has been demonstrated that the tap-leakage adjustment 
algorithm prevents the accumulation of large equalizer tap values, 
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while permitting the full performance gain of a fractionally spaced 
equalizer to be realized. 

I. INTRODUCTION 

Fractionally spaced equalizers (fses), which are nonrecursive, 
tapped, delay-line adaptive filters, are currently receiving much atten- 
tion 1 " 7 because of the significant performance advantage they provide 
when compared with a conventional synchronous equalizer. Since an 
fse has the capability to adaptively realize the optimum linear re- 
ceiver, it can greatly improve the performance of a modem in the 
presence of severe linear distortion. More specifically, significant per- 
formance improvements have been observed owing to the ability of an 
fse to compensate effectively for delay distortion at the limits of 
private-line, voice-grade channel conditioning. 8 However, in laboratory 
experiments with a digitally implemented fse it was noticed that after 
an extended period of operation some of the equalizer tap weights 
would invariably become large, while the mean-squared error (mse) 
remained at a satisfactory level. The taps generally would become so 
large that one or more registers, which compute partial sums of the 
equalizer output, would overflow, and the modem performance was 
then substantially degraded. This phenomenon is a consequence of the 
fact that an fse, in contrast to a conventional synchronous equalizer, 
generally has many sets of tap values that correspond to roughly the 
same mse. Included in the set of tap values that correspond to the 
minimum mse are some tap coefficients of relatively large magnitude. 
These large tap values can be attained because of the cumulative effect 
of noise or any bias in the digital circuitry that performs the equalizer 
updating. Even though the value of the mse is satisfactory, some of 
these tap values will be large enough to cause occasional overflow of 
the partial sums computed to form the equalizer output. The purpose 
of this paper is twofold: to elucidate why a fractionally spaced equalizer 
has so many apparently "good" sets of tap values, and to indicate how 
equalizer operation can be stabilized by simply modifying the conven- 
tional estimated-gradient tap-adjustment algorithm. 

The "almost unique" nature of the fractionally spaced equalizer 
coefficients is discussed in Section II. In Section III we describe the 
reasons for the occurrence of large tap values, and in Section IV a 
modified adjustment algorithm, dubbed the tap-leakage algorithm, is 
proposed to remedy the observed equalizer instability. The results of 
laboratory experiments are reported in Section V. 

II. DOES A FRACTIONALLY SPACED EQUALIZER HAVE A UNIQUE 
OPTIMUM SETTING? 

2. 1 Fractionally spaced equalizers 

To answer the question posed by the title of this section, we refer to 
the simplified baseband data transmission system shown in Fig. la. 

1818 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1 982 





3 

4 



Sh 



Mi-. 

5 

+ 



•do 

s i 

< 5 

— — 

a 

i! 

g s 

■w c 
S o 
P.-S 



■i 



a£g. 

§£g< 

£ § >> 

ai C C 

in«.2 
23 jB-S 

5T O CB 
I &* 

i.So 

bb £ .; 

— D J) 

3? a 



TAP-LEAKAGE ALGORITHM 1819 



For the purposes of exploring the phenomenon of large-tap buildup, 
this baseband model will suffice. Referring to the figure: {a n } are the 
discrete-valued multilevel data symbols, 1/T is the symbol rate, p(t) 
is the band-limited transmitter pulse (whose spectrum is shown in Fig. 
lb), f(t) is the channel impulse response, and v(t) is the additive 
background noise. Note that the receiving filter output, r{t), is sampled 
at the rate 1/T', and the samples are then passed through the tapped 
delay-line equalizer (shown in Fig. lc) having (2N + 1) delay elements 
spaced T'(<T) seconds apart and weighting coefficients {c„}. The fse 
output 



q(nT) = £ c m r(nT - mT), n = l,2, 

m=-JV 



(1) 



is computed at the symbol rate and quantized (sliced) to provide the 
data decision, d n . The transmitted pulse spectrum, shown in Fig. lb, 
generally will be band-limited to (1 -I- 0L)m/T radians/second where the 
rolloff factor, a, varies between and 1. An fse with tap spacing T' 
seconds will have a transfer function, Ct, (w), with period 2tt/T', and, 
as shown in Fig. 2, if T' < T/(l + a), the transfer function of the fse 
will span the entire spectral range of the transmitted signal. This 
enables the equalizer to exert complete control over the amplitude and 
delay distortion present in the region < |o>| < (1 + a)ir/T. Conse- 
quently, the fse can compensate for these distortions directly, rather 
than filtering the aliased (folded) spectrum, as is done by the conven- 
tional equalizer. 9 As an example of this feature, consider the ability of 
the fse to compensate for delay distortion by synthesizing the phase 
characteristic conjugate to that of the received pulse. This operation 
will leave the noise power at the equalizer output unchanged from the 
received noise power. The conventional synchronous equalizer, be- 
cause it has a transfer function that has period 2tt/T, can only equalize 
the folded spectrum, i.e., the compensation characteristics on either 
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Fig. 2 — Transfer function of a fractionally spaced equalizer. 
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side of tt/T are restricted to be the conjugate of one another. If the 
channel characteristics are such that the folded spectrum has a relative 
null at a particular frequency, then the noise at the output of the 
synchronous equalizer can be significantly enhanced and the perform- 
ance degraded proportionately. 

From Fig. 2 it should also be evident that when the noise becomes 
vanishingly small, there is legitimate concern as to what function(s) 
the equalizer will synthesize in the region, (1 + a)ir/T < |w| < Itt/T, 
where there is no signal energy. It is known 7,8 that for an infinitely 
long equalizer the transfer characteristic that minimizes the mse is 

C r , (w)= -^ , M<— , (2) 



2*7 

+ 1-^ 

T 



T 



where the asterisk denotes the complex conjugate, X(u) is the transfer 
function of the received pulse, f x(t), presented to the equalizer input, 
and No is the noise spectral density. As long as N ¥> 0, the equalizer 
function is zero whenever the received signal has no power; however, 
as No -* the derivation of (2) is no longer valid, and, moreover, (2) 
approaches 0/0 in the region (1 + a)n/T < | w| < it IT. Clearly, as the 
noise vanishes, an infinitely long fse can synthesize the required 
channel characteristic in the region < |w| < (1 + a)v/T, and an 
arbitrary — and nonunique — characteristic in the remaining frequency 
band. An interesting question, then, is what happens to the optimum 
tap setting for a finite length fse as the noise becomes vanishingly 
small. 

2.2 Uniqueness of solution for finite length FSE as the noise vanishes 

The equalized mse is defined as 

E = ([q(nT) - a n f) = (el), (3) 

where the brackets denote the ensemble average with respect to the 
data symbols and the noise, and e„ is the equalizer output error, q(nT) 
- a nt at t = nT. The mse is readily evaluated as the quadratic form 

E = c'Ac - 2c'x + (al), (4) 

where the prime denotes the transposed vector, c' is the tap vector 
(c-n, • • • , Co, • • • , c N ), x' is the truncated impulse-response vector 
\x(NT'), ••• , x(-NT')\, and A is the channel-correlation matrix. 
More specifically, the channel vector is given by 

x = <a„r„), (5) 



+ Thus x(t) is the convolution of the transmitter, channel, and receiver filters. 
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where the received vector is given by r^ = | r(nT + NT'), • • • , r(nT), 
r(nT — NT') \ , and the A matrix is defined by 

A = <r„r;). (6) 

The kith element of the channel correlation matrix is given by 

00 

A kl = £ x{mT - kT')x{mT - IT) + o 2 8 k -i, (7) 

m— — oo 

where a 2 is the noise variance and 8k is the Kronecker delta. Note that 
A is not a Toeplitz matrix, as it would be for a synchronous equalizer 
(T' = T). If A is nonsingular then the optimum setting and the 
corresponding minimum mse are obtained from (4) by differentiation, 
and are given by 

c op t=A -1 x (8a) 

E opt = 1 - x'A-'x, (8b) 

where {al) is taken to be unity. 

As seen from (7), the matrix A is the sum of two matrices, and as 
will be evident from the discussion that follows, the channel-dependent 
component of A is always positive semidefinite. Since the other com- 
ponent of the channel-correlation matrix, a 2 !, is positive definite, then 
A will also be positive definite, and we can conclude that when there 
is noise present, the optimum tap setting is unique. 

We now consider the situation as the noise becomes vanishingly 
small; clearly, the optimum tap setting will be unique if, and only if, A 
is nonsingular. A sufficient condition for A to be nonsingular is the 
nonvanishing of the quatratic form u'i4u, for any nonzero test vector 
u with components [«,-]. Let us consider in detail this quadratic form, 
which we write from (7) as: 

N 

U'AU = £ UnAmnUn 
m,n—N 

N a, 

= YZ u m u n £ x(lT - nT')x(lT - mT) 

m,n=—N /— — oo 



r n i 2 

I u m x(lT-mT') 



l o 



2=0. (9) 



The above inequality establishes the positive semidefinite nature of 
the matrix A, and we see from (9) that u'Au can vanish only if* 



* The authors gratefully acknowledge discussions with J. E. Mazo that led to this 
development. 
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£ u m x(lT - mT) = 1 = 0, ±1, ±2, 

m — N 



(10) 



If we define the periodic Fourier transform 

N . T . IT 

U't(<»)= 2 ««^ r , l«l ^™> 

then we can proceed further by noting that 



(11) 



N 



I u m x(lT - mT) = £ "« X< u )e""-"" — 
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*■ (12) 



The right-hand side of (12) is recognized as the sample, at t = IT, of a 
function whose Fourier transform, Zeq(co), is contained in the brackets. 
If (12) is to be zero for every value of I, then it must be that the Fourier 
transform inside the integral vanishes completely, i.e., 



In Fig. 3a we show the situation when there is no excess bandwidth, 
and since the sum, (13), reduces to one term, the only way for 
Ze q (w) = is for either X(u) ■ or U T -(u) = 0. Since this implies that 
Ut-(co) = 0, it would violate the nonzero requirement on u. Thus, we 
can conclude for this case that A is positive definite. A similar sketch 
for the less than 100-percent excess bandwidth case is shown in Fig. 
3b, where it is noted that only the k = 0, ±1 terms contribute to the 
sum, (13). However, in the nonrolloff region, |w| < (1 - a)w/T, only 
the k = term influences the sum. For channels that do not vanish 
over the entire nonrolloff region, it is clear that for Z eq (u>) to vanish it 
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CONDITIONS FOR WHICH Z eq (U) = £ U r ((ti + *-y") x [u + ^j?) = 0, (w) < j- 
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Fig. 3— Sketches associated with eq. (13). (a) a = 0. (b) < a < 1. (c) a = 1. 

is required that Ut{u) vanish at least over the entire nonrolloff region. 
Since Ut'(u) is a finite-term Fourier series, it cannot vanish over an 
interval without vanishing everywhere, which in turn would again 
make u = 0. Note that if the channel vanished over a portion of the 
nonrolloff region, then since Z eq (w) is a finite- term Fourier series, its 
energy could not be totally concentrated in the region where there was 
no channel energy. Thus, the solution still would be unique. It is worth 
noting that in the extreme case of 100-percent excess bandwidth, Z eq (o>) 
can vanish. For example, in Fig. 3c we sketch the situation for a 
constant X(to), and with Ut{u>) = cos toT/2 it is apparent that 
Zeq(w) ■ 0. Thus, for a finite-length fse with an excess bandwidth of 
less than 100 percent, we can conclude that even as the noise becomes 
vanishingly small, the A matrix is nonsingular and there is a unique 
optimum tap setting. 

We digress for a moment to point out that for a finite-length 
synchronous equalizer where T' = T, (13) indicates that since 
Ut(u + hlm/T) = Ut(o>), we can conclude that if the folded channel 
spectrum does not vanish completely, then there is always a unique 
tap setting. 

III. THE TAP-WANDERING PHENOMENON 

3. 1 Motivation, background, and infinite-precision considerations 

We have shown that for a finite-length fractionally spaced equalizer 
and the practical range of interest — where the excess bandwidth is on 
the order of 10 to 50 percent — even with vanishingly small noise there 
will always be a unique best tap setting. One issue of interest is the 
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"closeness" (or ill-conditioning) of the A matrix to a singular matrix. 
This is important for two reasons. First, the distribution of the eigen- 
values of A, which is a measure of the ill-conditioning of the matrix, 
influences the rate of convergence of the equalizer taps to their 
optimum setting. 10 Second, and more importantly, we observe, from 
(4) that the contours of equal mse are elliptical, and the eccentricity of 
these contours is directly related to the eigenvalue distribution. In the 
appendix it is shown that for an infinitely long equalizer with T = 
T/2, half the eigenvalues are zero; in Fig. 4 we illustrate some constant 
mse contours for a finite-length T/2 equalizer, whose optimum tap 
setting is denoted by c opt . Recall 10 that even with an infinite-precision 
analog implementation, the use of a finite step-size in the conventional 
estimated-gradient tap-adjustment algorithm results in a steady-state 
mse that exceeds* E opi . This is depicted in Fig. 4, where the boldface 
contour is the mse that can be attained with the chosen step-size. 
Owing to the random component in the algorithm's correction term, 
the taps will wander along the constant mse contour, and there will be 
a certain probability that the taps will become so large that one or 
more registers will saturate. 1 Thus, even in an analog implementation, 
random tap wandering can, in principle, lead to degraded performance. 

3.2 A model for tap drifting in digital equalizer 

It has been observed in laboratory experiments with a digitally 
implemented fse, that under control of the conventional estimated- 
gradient tap adjustment algorithm, 10 

c„+i = c„ - a[e„r n ], n = 1, 2, 3, • • • (14) 

the equalizer taps inevitably drift close to the shaded (large tap) region 
of Fig. 4. In (14), c is the tap vector at t = nT, a is a positive value 
called the step-size, which influences both the convergence rate of the 
equalizer and the steady-state mse, and the brackets around e„r„ 
indicate that this increment is quantized to a specified number of bits. 
The term [e„r n ] will have a deterministic component proportional to 
the desired gradient, and a random component owing to both the 
manner in which the digital quantization is performed and the influ- 
ence of the noise and data-dependent terms. Generally, [e„r„] will also 
possess a deterministic component owing to bias inevitably present in 
a digital implementation. A typical mechanism for such a bias is the 
two's complement type of quantizing characteristic shown in Fig. 5. 
To quantify our discussion, we denote the bias by a time-invariant 



* The mse E„ pl is achieved when the taps are at their optimum values, c opl . 
+ For a synchronous equalizer the ellipses will not be very eccentric, and the tap 
wandering will not do any damage. 
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Fig. 4 — Contours of equal mse and tap convergence. 



vector, b, and model the adjustment algorithm as 



c„+i = c„ - a(e n r n + b), 



(15) 



where the bias vector has equal components, and b has a magnitude of 
less than half a quantization interval. Equation (15) ignores, except for 
the bias, the effect of limited precision on the algorithm.* Since it has 
been observed in the laboratory that tap wandering results in a 
systematic buildup of some tap values, the model expressed by (15) 
should be useful in relating the magnitude of the bias to the other 
system parameters. It is bias component that can drive, in a determin- 
istic manner, + the tap vector towards the tap region corresponding to 
large tap values. Since the equalizer output is formed as a series of 
partial sums, of the form £m c m r n - m , it is clear that large-tap values can 
lead either to an overflow of a partial sum or saturation of a tap. As 
the taps grow, occasional register overflows begin to occur, resulting in 
noise-like "hits" on the equalizer output. The occurrence of such a 
"hit" is a function of the specific pattern of data samples contained in 
the equalizer. Continued growth of the taps increases the frequency of 
the "hits" as more data patterns can produce these events. The error 
rate can become very high, relative to what constitutes acceptable 
performance, but the frequency of occurrence of "hits" is still low 



* Reference 10 discusses the effect of limited precision on the mse or an fse in the 
absence of a bias term. Note that the bias as "seen" by the taps is ab. 

* As opposed to the random wandering associated with the self-noise of the algorithm. 
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Fig. 5 — Quantizing characteristic that produces a bias in the tap-adjustment algo- 
rithm. 

enough that the mse is almost unaffected. Growth continues until the 
tap coefficients themselves begin to saturate. At this point severe 
degradation occurs as the degrees of freedom of the system are reduced. 
In fact, with an experimental digitally implemented fse, with a symbol 
rate of 2400 symbols/second, overflows typically begin to occur within 
several minutes of operation. The overflows produce noise-like hits 
and a degraded equalizer output. 

3.3 The mean tap error and the mean-squared error 

To assess the effects of the bias quantitatively we first define the 
tap-error vector, 

€ n = C n - Copt, (16) 

and then use the model described by (15) to write the tap-error 
evolution as 

e„+i = €„ - a(e„r„ + b). (17) 

The mse at the nth iteration,* E„, is given by 

E n = E opt +(€ , n A€ n ). (18) 

Our intent is to study the excess mse, 

q n m(^A€n), (19) 

and the mean tap error vector, <e„), in the presence of the bias, b. 



* In arriving at (18), we assume that iterations are infrequent enough so that 
successive vectors {r„} are independent. In practice, adjustments are generally made at 
the symbol rate, and the algorithm is observed to behave as if the {r„} were independent. 
This phenomenon is discussed in Ref. 11. 
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From (17) we have 

<€„ + i> - <€„> - a(e n r n ) -ab=(I- aA)(€ n ) - ah, (20) 

and thus the steady-state mean tap error satisfies 

(e)=A- 1 b. (21) 

If A, and p„ respectively, denote the ith eigenvalue and eigenvector of 
A, then 

<€> = S^p i . (22) 

Clearly, if there is a small eigenvalue whose eigenvector is not orthog- 
onal to b, then the steady-state tap error can be quite large. It is 
interesting to note that for a small number of taps, one would expect 
that the eigenvalues not be small, i.e., the equalizer would not possess 
enough degrees of freedom to realize a somewhat arbitrary transfer 
function beyond the rolloff region. Consider the one-tap equalizer (or 
automatic gain control) where A = (r 2 (nT)), and consequently 

«> = <7W < 23 > 

Thus, for a one-tap equalizer, the tap error is directly proportional to 
the magnitude of the bias, and the buildup of a large tap value is 
prohibited. However, in the limit as the number of taps becomes 
infinite, it is shown in the appendix that with T' = T/2, half the 
eigenvalues are zero, while the other half tend to uniformly sample the 
aliased (with respect to the symbol rate) squared magnitude of the 
channel transfer function. Moreover, the eigenvectors corresponding 
to the zero eigenvalues have most of their energy concentrated near 
1/T Hz (and are thus close to being orthogonal to b), while the ith 
eigenvector corresponding to the nonzero eigenvalues approaches a 
sinusoid of radian frequency o>, = i/N tt/T. For practical, finite-length 
equalizers, these limiting conditions will only be approximated, and 
there will be small eigenvalues whose corresponding eigenvector is not 
orthogonal to b. Consequently, (e) can become as large as the largest 
ratio [pJb]/A„ and the steady-state tap error would then be biased 
away from the optimum value. 

We now discuss the effect of the bias term on the equalized mse, 

E n - -Eopt + (€' n A€ n ) , (24) 

where the tap error evolves according to (17). In particular we will 
examine the size of the residual mse, 

q n = (e n A€n). (25) 
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An exact analysis (or tight bounds) of the behavior of q n is an extremely 
difficult problem; however, by assuming that the sequence {r„} is 
independent," and that when the taps are at or near their optimum 
settings, the squared output error is relatively insensitive to the trans- 
mitted data pattern,* it is possible to establish simple, but useful, 
relationships between the relevant system parameters. From (17) we 
have that 

q n+ i = Wn+iAen+x) = <[€'„ - a(e n r' n + b')]A[e n - a(e„r„ + b)]> , (26) 

and by using the above assumptions we have (see Ref. 10) 
q N+i m [1 - 2a\ + a 2 X M (2N + l)(r 2 „)]q n 

+ a 2 X M (2N + l)((r 2 n)E opl + b 2 ), (27) 
where X M is the maximum eigenvalue of A, and where 

A=— 1— £A, (28) 

2N +l- N 

is the average eigenvalue. Thus, the steady-state fluctuation about the 
minimum mse is 



m aX M (2N + l)[(r 2 (nT))E opt + b 2 ] 
Qm ~ 2X-aX M (2N+l)(r 2 (nT)) 



(29) 



To assess the effect of the bias on q m , we note that the bias "seen" 
by a tap component, ab, will be approximately 2 _fl C ma x, where B is the 
number of bits used to represent the tap weights and Cmax is the 
maximum tap value. If the equalized signal is assumed to have unity 
power, then C ma x * 1/[(2N + D<r 2 „)] 1/2 and a is typically 10 on the 
order of 1/ 1 (2N + 1) • <r 2 > |. Thus, b 2 / \ r 2 (nT) | will be on the order of 
2~ 2B (2N +1), and when the equalized output signal power is unity, 
Eo P i is roughly the inverse of the output signal-to-noise ratio. With 
typical parameters like (2N + 1) = 60, B = 12, and £ op t = 0.001, it is 
clear that the effect of the bias on q^ is negligible. Thus, owing to the 
quantizing bias, there can be, on the average, a buildup of one or more 
large-tap weights, while the mse is relatively unaffected. In other 
words, under the influence of a bias the taps would still remain on the 
boldface mse contour of Fig. 4, but would spend most of the time near 
the shaded region, and the system would be subject to random over- 
flows, or hits. This phenomenon has been repeatedly observed exper- 
imentally, and in the next section we will describe a very simple means 
of controlling the tap wandering. 



* When the taps are at their optimum values, the error is known to be uncorrected 
with the received samples. 
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IV. THE TAP-LEAKAGE EQUALIZER ADJUSTMENT ALGORITHM 

As we have discussed in the previous section, some or all of the tap 
weights in an fse can reach unacceptably large values when the 
conventional tap adjustment algorithm, (14), is used. A simple means 
of controlling large-tap buildup is by miriimizing either of the aug- 
mented cost functions 

N 

Ji = E + n J c 2 , (30a) 

i — N 
N 

J 2 = E + tx I \a\, (30b) 

i — N 

where, as in (3), E is the mse and fi is a suitably chosen (small) 
constant. The cost function J\ ascribes a quadratic penalty to the 
magnitude of the tap vector, while J% provides a magnitude penalty, 
i.e., the cost function is penalized whenever the tap vector builds up 
excessively. Since the taps are to be adjusted adaptively, we cannot 
interpret p. as a Lagrange multiplier. The use of a Lagrange multiplier 
would be appropriate if we were actually able to niinimize J in a 
deterministic manner by using the true gradient. However, since the 
gradient of E with respect to c, Ac — x, is not available, we must 
implement a stochastic algorithm analogous to (14). Thus, /x must be 
chosen beforehand by using some prior knowledge of the system 
parameters. 

4. 1 Increased steady-state mse: true-gradient algorithm 

As a preliminary calculation, let us first consider the degradation in 
the minimum attainable steady-state mse caused by choosing c to 
niinimize J\ instead of E. Note that, for the moment, we are neglecting 
the bias and only assessing the increased mse caused by minimizing 
the augmented cost function, Ji, via the true-gradient algorithm. From 
(4) we observe, for binary transmission, that the taps will attempt to 
minimize the modified criterion: 

J x = c'(A + [il)c - 2c'x + 1. (31) 

= c'Bc - 2c'x + 1, (32) 

where B = A + yJ. The matrix B has the same eigenvectors as A, 
while the eigenvalues of B are A + /a. Note that the contours of equal 
values of J\ are still ellipses but the maximum-to-minimum eigenvalue 
ratio governing the tap wandering is now (A ma x + ju)/(A min + /*), where 
Amax and Amin are the maximum and minimum eigenvalues of A, 
respectively. Thus, by choosing fi properly, the eccentricity can be 
controlled, and the equalizer tap vector is now determined as if the 
noise power were increased from a 2 to a 2 + /a. Of course, the use of a 
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tap vector selected on the basis of the pseudo-noise power, ct 2 + n, will 
increase the steady-state mse.* Note that the steady-state tap vector 
will now satisfy 

Bc = x 

or 

c(/t) = B~hi = (A + pi)- 1 *, (33) 

where c(/i) denotes the steady-state tap vector corresponding to the 
chosen value of fi. Thus, the minimum attainable mse is given by 

E(fi) = c»Ac(ju) - 2c»x + 1, (34) 

and the increased mse, E(fi) — E opU is 

E(n) - Eo P t = [c(/a) - c 0P t]'A[c(/i) - Copt]. (35) 

To make a more detailed evaluation of the increase in mse, we let 
c(/i) = c(/x) - Copt (36) 

denote the tap-error vector. Recalling the diagonalization 

A= X tapf, (37) 

and the fact that inverse matrices have the same eigenvectors and 
inverse eigenvalues, we find that 

€(/x) - c(m) - Copt = [(A + fiiy 1 - A _1 ]x 



t A,(A, + /i) 

Substituting (38) into (35), and using the orthogonality property of 
distinct eigenvectors, we find that 

£( M )-Eo P t = M 2 i (P'^ Val,,^ (39) 



i-=-N 



MA.- + /0 2 ' 



Thus, to a first approximation, the increased mse grows only as the 
square of the leakage parameter, /i, while the eigenvalue distribution— 
and the range in which the taps can wander— can be favorably altered, 
in a significant manner, by using even a very small value of /i. 



* The fluctuation about the minimum mse, caused by the finite step size, will also be 
examined. 
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4.2 The adaptive tap-leakage algorithm 

In a manner analogous to the commonly used estimated gradient 
algorithm, the adaptive tap-leakage algorithms are constructed, from 
(30a), by minimizing the augmented instantaneous squared error, 
el + nc' n c n , and, from (30b), by minimizing el + [i JjL-n \Cn } \. The first 
algorithm modifies the gradient by a term proportional to the tap 
vector itself, giving the algorithm 

c n +i = c„ - a[e„r„ + /ac„] 

= (1 - a/i)c„ - ae n r n , (40a) 

while the second algorithm is of the form 

c„+i = c„ — a(e n r n + ft sgn c„) 

= c„ - ajx sgn c„ - ae n r n , (40b) 

where the sgn operation is applied individually to each component of 
the tap vector. Note that from an implementation point of view, the 
algorithm can be modified with almost no hardware change other than 
applying a systematic decrement to the magnitude of each tap. The 
second algorithm (40b) has the practical advantage that adjustments 
will continue to be made no matter how small any tap weight becomes, 
while the first algorithm has the "advantage" of analytical tractability. 
Consider now the mean tap error when the leakage algorithm (40a) 
is used. In the presence of digital bias, the algorithm is modeled as 

c„+i = c„ - a[e n r n + b + /ic„]. (41) 

Subtracting c op t from both sides of (41), and solving for the steady- 
state average tap error we find 

<€> = (A + nD^h +(A+ iiI)- l iiA-^s. 

_ I *„ + „ i P* , ,42) 

i— N \i + H i—H Ai(A« + /X) 

The first term, on the right-hand side of (42), is similar to (22), but 
note that the eigenvalues have been modified to eliminate the cata- 
strophic effects that can accompany vanishingly small eigenvalues. 
The second term, which is proportional to the leakage parameter, is 
similar to (39), and represents the increased tap error caused by the 
minimization of J\ and not E. The leakage parameter, /a, must be 
chosen sufficiently large so that the first term is properly controlled in 
magnitude, but not so large that the magnitude of the second term 
becomes appreciable. In general, the choice of p. is best done empiri- 
cally, but if [i is chosen to be in the range where Aminimum + /t « /i, then 
equating the magnitude of the two terms in (42) gives 
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1 (p/b) 2 

, — N A, 

For any given channel, (43) can be evaluated, but as far as the average 
tap error is concerned, it is sufficient to choose /i in such a way that 
the magnitude of (e) is within the nonshaded region of Fig. 4. From 
(42), one reasonable choice is to make y. roughly equal to the smallest 
value of A, for which p/x is significantly larger than A,. 

To assess the effect of the tap-leakage algorithm, (40a), on the 
steady-state mse we recall eq. (18) 

E n = E pt + Qn, 

where E op t is the minimum mse when both b and fi are zero. From (41) 
we can compute an approximation to the steady-state value, q, , in a 
manner similar to the computation of (28). For the adaptive tap- 
leakage algorithm we find that the fluctuation about the minimum 
mse is 

_ aX M [(2N + 1) • (r 2 (nT) )E opl + b'b] + 2a M b'x + ^V^" 1 * 
~" ~ 2X + n(l - aX) - (x[X M (2N + 1) (r 2 (nT) > + fi 2 ] 

Obviously, (44) is very channel dependent, but when fj. is chosen on 
the order of a small eigenvalue, then the fluctuation about the mini- 
mum mse is a rather insensitive function of the leakage parameter, 
while the magnitude of the mean tap error, (42), can be effectively 
controlled by the proper choice of p. 

In the next section we will discuss the results of laboratory experi- 
ments that use the tap-leakage algorithm to control the potentially 
unstable operation of a fractionally spaced equalizer. 

V. LABORATORY EXPERIMENTS 

For an experimental 9.6-kb/s data transmission system using 16- 
point quadrature amplitude modulation, the tendency of the tap 
coefficients of a digitally implemented equalizer with T/2 sample 
spacing to drift is demonstrated by the waveforms of Figs. 6a and 6b. 
These waveforms are analog representations of the values of one 
component of a set of complex tap coefficients associated with data 
samples taken T(= 1/2400) second apart. The equalizer is physically 
constructed by using two conventional T-spaced interleaved struc- 
tures, requiring four tap coefficient component distributions to totally 
describe the state of the equalizer. 

The equalizer goes through a conventional start-up procedure, ex- 
cept that timing recovery and carrier phase adjustments are suppressed 

TAP-LEAKAGE ALGORITHM 1833 




Fig. 6 — Tap buildup as a function of time, (a) Three-minute intervals, (b) Five-minute 
intervals. 



so that the distribution of tap coefficients will not change through 
interaction with either of these operations. The transmitted signal is 
fed back to the receiver, after passing through appropriate attenuators, 
thus avoiding any time varying channel characteristics, and assuring 
a low noise environment. 

The first trace of Fig. 6a illustrates the distribution of components 
among the particular collection of coefficients immediately after start- 
up. A single large negative component is noted, with all other compo- 
nents relatively insignificant. The coefficients are updated in the usual 
fashion, via (14), without the addition of a tap-leakage adjustment. 
Subsequent traces in Fig. 6a are taken at 3-minute intervals. The 
traces of Fig. 6b are a continuation of Fig. 6a with the separation in 
time extended to five minutes. A clear pattern of buildup in the 
amplitude of the taps, particularly those immediately preceding the 
original dominant tap, is demonstrated. A similar compensatory 
buildup occurs among those tap coefficient components not displayed, 
such that the mse is essentially unchanged over the duration of the 
test. This deterministic growth of tap amplitudes will eventually lead 
to saturation of shift register accumulators used in forming the various 
components of the equalizer passband outputs. The observed output 
signal constellation will display frequent apparent noise-like hits of 
large amplitude, and an unacceptable output error rate results. 

The tap-coefficient components in the laboratory configuration are 
stored in 24-bit shift registers. The 12 most significant bits are used in 
the multiplication to form tap-product outputs. The remaining bits 
were to average out the effects of tap updating, which is normally done 
according to the rule 

Cn+i = c„ — a(e n r„) . 

The components of the updating quantity — a(e„r„) are stored in 12-bit 

1834 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1 982 



words which are added to the 20 most significant bits of the coefficients 
during the normal steady-state mode of operation. To counteract bias 
in the arithmetic, the updating is changed to the tap-leakage algorithm 

c„+i = c„ - ft sgn c„ - a(e n r n ) , (45) 

where ft = a/x of (40b). In the experimental setup, a count of 1 is added 
or subtracted to the 23rd most significant bit of each component of 
each tap coefficient once each symbol interval. In steady-state opera- 
tion, the 12-bit updating signal will typically show activity in a mini- 
mum of the five least significant bits. In general, therefore, the leakage 
term is quite small compared with the conventional updating term. 

The effect of introducing this leakage is shown in the waveforms of 
Fig. 7. The top trace shows the coefficient distribution 40 minutes after 
initial start-up, at the time the leakage is enabled. Subsequent traces 
were taken at 30-second intervals. Within two minutes the coefficient 
components had been virtually restored to the state that existed 
immediately after start-up. 

The experimental arrangement allows the leakage to be scaled over 
a wide range. Viewing the 24-bit coefficient as an integer, the leakage 
increment can be made 2 r , r = 0, 1, • • • , 7 (r = 1 is the case displayed). 
Since the coefficients are chosen to span an analog range of ±4, this 
corresponds to ft as defined in (45) ranging from 2" 21 to 2" u , with ft = 
2" 20 displayed. The value of a used in the experiment is a = 2" 11 . It is 
observed that ft = 2" 21 , the lowest possible level of continuous leakage, 
is adequate to suppress tap drift, indicating the extreme low level of 
the system bias to which the fse updating algorithm appears suscep- 
tible. 

The larger the value of ft that is chosen, the more the leakage will 
degrade the equalizer performance, although the degradation is negli- 
gible for ft less than 2" 17 . In normal data-set operation, it is observed 




TAP VALUES — — 

Fig. 7— The effect of the tap-leakage algorithm (30-second intervals between traces). 
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that a substantially rapid shift in the sampling epoch, which may occur 
during the timing recovery operation, will greatly accelerate the 
buildup of tap coefficients. The choice of ft = 2 -18 will allow rapid 
response to this situation without perceivable performance degrada- 
tion. 

VI. CONCLUSIONS 

Effective control of tap drifting for a fractionally spaced equalizer, 
at a 9.6-kb/s data rate, has been demonstrated by employing the easily 
implemented tap-leakage algorithm. The tap-leakage algorithm, or 
some variation of it, might be appropriate for any digitally imple- 
mented adaptive system that has too many degrees of freedom and 
that exhibits coefficient wandering. 

APPENDIX 

Asymptotic Distribution of the Eigenvalues and Eigenvectors for 
Synchronous and Fractionally Spaced Equalizers 

In this appendix we describe the eigenvalues and eigenvectors of 
infinitely long synchronous and fractionally spaced equalizers. 

A. 1 Synchronous equalizer 

First we recall the eigenvalues and eigenvectors of an infinitely long 
synchronous equalizer. From (7) we have the eigenvalue equation 

N 

£ A k - lPl = \p k , N^k^N, (46) 

/ — N 

where A is an eigenvalue and p' = (ps, • • • , po, • • * , Pn) is the 
associated eigenvector. As N — * oo, taking the Fourier Transform of 
both sides of (46) yields 

A(o>)P(u>) = \P(u>), |u|<=^, (47) 



where 



A(«) = 



w¥ 



2 
+ 2 



= |X eq (o;)| 2 + a 2 , |«|<^. (48)* 

The only way for (47) to be satisfied [with P(co) ^ 0] is for P(u) to be 
concentrated at a single frequency [unless A (cu) has the same value at 



* Recall that the Nyquist-equivalent spectrum X, q (w) is defined as Xea(<o) = 
2 X(u + k2n/T). 
* 
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more than one frequency]. If we let 

i it 

then the solution to (47) is 

A, = AM 

-N<i<N 

Pdu) ««(«-«,). (49) 

Thus, for a synchronous equalizer the asymptotic (iV-> oo) eigenvalues 
uniformly sample the folded- channel plus noise spectrum, and the 
eigenvectors are the corresponding sinusoids. 

A.2 Fractionally Spaced Equalizer 

Here the channel-correlation matrix, while symmetric, is not 
Toeplitz; thus Fourier Transform techniques do not yield the eigen- 
values and eigenvectors in the above short order. For convenience we 
consider the noiseless situation, and the eigenvalue equation becomes 

£ A(kT, IT')p(lT') = Xp(kT') -N<k<N, (50) 

/ — N 

where the channel- correlation matrix has elements 

A(kT, IT) = 2 x(mT - kT)x(mT - IT). (51) 

m 

With T = T/2 and N -> oo we write (50) for even and odd values 

I/(»H*(<-rhs/(44M^ 

= \p(k^),keveix (52) 

= \p(k?\,kodd. (53) 
Now (52) and (53) can be written respectively as 
I A(kT, IT)p(lT) +Y l A(kT,lT)+^p [IT + |j 

= Xp(kT), -oo < k < oo (54) 
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and 



(55) 



v( kT+ l iT ) pm+ v{ kT+ l iT+ l) p { i l) 

where both equations hold for all integer values of k, and, more 
importantly, the various component matrices are now all Toeplitz.* If 
we let 

X«(i*)AX(«)-xL-^\-xL + ^\ |<o|<^ (56) 



and 

PMAPM-pL^-pL + y)' H*f (57) 

then taking the synchronous Fourier Transform (i.e., with respect to 
the T seconds sampling interval) of (54) and (55) gives 

1-XeqM | 2 P(w) + X eq (u>)X* eq (u)P(io) = AP(<o) < | (0 | < ^ (58) 

and 

X eq (co)A- e * q (w)P(co) + l^eq(cd) | 2 P(o>) = AP(to). (59) 

Note that p(£T + T/2) has the Fourier Transform 

_• T 

while the Transform of p(kT) is of course P(o). Arguing as we did for 
the synchronous equalizer, we see that the ith eigenvectors Pt(u) and 
Pi(co) must again be delta functions at on = i/N tt/T. Upon setting the 
determinant of the pair (58) and (59) to zero, we see that the eigen- 
values satisfy 

A, 2 - K\ \X eq M | 2 + \ZnM | 2 | = 0, (60) 

and thus for each value of to, there are two eigenvalues 
A, = 

2 



A,-=|X eq ( W| )| Z +|Xe q (cO I )| 2 = £ 



* 



k2n 
I Wi + — 



(61) 



In contrast to the synchronous equalizer, half of the eigenvalues are 



* For example, A (kT, IT+ T/2) = ^ x(mT- kT)x(mT- IT + T/2) = V„ x(nT)x\nT 
+ (k - l)T + T/2\. ' 
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exactly zero, while the other half are samples of the aliased magnitude- 
squared channel transfer function. Not surprisingly, the eigenvalues 
are independent of the receiver sampling phase. Once the eigenvalues 
are determined we can solve for the eigenvectors. The ith eigenvector 
associated with the zero eigenvalue is 

XZjMe***, n even 

( 1> W (62) 

juA n+-]T 

-XUMe v 2 > , n odd, 
while the eigenvector associated with the nonzero eigenvalue is 

X^Me ju ' nT , n even 

X eq Me J ""\" 2r , nodd. 



P'\ n ^\ 



Pi\*2i m 



mu4)t < 63 > 



At this point we remark that when co« is not in the rolloff region then 
Xeq(ui) = X eq M, and (63) describes a sinusoid of frequency at, since 
the even and odd portions of pfyi T/2) mesh together in a continuous 
manner [i.e., 

Pi (n T/2) = X eci Me ju - nT/2 l 

However, (62) describes a function that changes sign and oscillates 
almost a full cycle in T seconds. Consequently, pt(n T/2), as given by 
(62), will have most of its spectral energy concentrated near l/T Hz. 
When o)i is in the rolloff region, the frequency content of (62) and (63) 
will differ somewhat from the above extreme cases but the general 
results will still be as above. 
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